Optimal Data Mining Classification Algorithm for Bio Medicinal Facts

نویسندگان

  • B. Madasamy
  • Dr. J. Jebamalar Tamilselvi
چکیده

The most significant application of bio medicinal data analysis is used to determine or classify the unknown samples with the help of known samples to make a decision. Several general purpose data mining classification techniques have been proposed to identify the bio medicinal patterns. Because of their high dimensionality, multiple classes, unlearned data and hidden values, bio medicinal datasets pretense a distinctive challenge of machine learning and data mining algorithms for classification. This paper endow with a complete evaluation of a set of diverse machine learning schemes on a number of bio medicinal datasets. The intend of this paper shows that data mining can be applied to the medicinal databases, which will predict or classify the data with a reasonable accuracy also find the optimal classification algorithm for bio medicinal data sets. It focused to study a few classification techniques such as Support Vector Machine (SVM), Nearest Neighbor Classifier (k-NN), Decision Tree Induction, Navie Bayes, C4.5, Genetic Algorithm, Rule based algorithm and Back Propagation through bio medicinal data bases. These Classification Techniques applies to UCI, KDD, PUBMED and publicly available in common medicinal datasets, and compared how these classification techniques performed in class prediction of test datasets to find the best classifier for bio medicinal identification. The performance of these classifiers can evaluate in terms of minimum threshold level to identify the bio medicinal pattern, subset merit, noise, imbalance ratio, missing values, execution time, accuracy of an algorithm, correlation-based feature selection information gain, training time, memory usage, and memory utilization has been analyzed. The effect of these complexity measures on classification accuracy is evaluated using above mentioned data mining machine learning algorithms. Keywords— Bio medicinal, Machine learning, UCI, KDD, Lazy learner, SVM, Back propagation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optimizing the Grade Classification Model of Mineralized Zones Using a Learning Method Based on Harmony Search Algorithm

The classification of mineralized areas into different groups based on mineral grade and prospectivity is a practical problem in the area of optimal risk, time, and cost management of exploration projects. The purpose of this paper was to present a new approach for optimizing the grade classification model of an orebody. That is to say, through hybridizing machine learning with a metaheuristic ...

متن کامل

Assessment of approximate string matching in a biomedical text retrieval problem

Text-based search is widely used for biomedical data mining and knowledge discovery. Character errors in literatures affect the accuracy of data mining. Methods for solving this problem are being explored. This work tests the usefulness of the Smith-Waterman algorithm with affine gap penalty as a method for biomedical literature retrieval. Names of medicinal herbs collected from herbal medicine...

متن کامل

Congestion Management through Optimal Allocation of FACTS Devices Using DigSILENT-Based DPSO Algorithm- A Real Case Study

Flexible AC Transmission Systems (FACTS) devices have shown satisfactory performance in alleviating the problems of electrical transmission systems. Optimal FACTS allocation problem, which includes finding optimal type and location of these devices, have been widely studied by researchers for improving variety of power system technical parameters. In this paper, a DIgSILENT-based Discrete Parti...

متن کامل

Optimal Placement and Sizing of TCSC & SVC for Improvement Power System Operation using Crow Search Algorithm

Abstract: The need for more efficient power systems has prompted the use of a new technologies includes Flexible AC transmission system (FACTS) devices. FACTS devices provides new opportunity for controlling the line power flow and minimizing losses while maintaining the bus voltages within a permissible limit. In this thesis a new method is proposed for optimal placement and sizing of Thyristo...

متن کامل

Data mining for decision making in engineering optimal design

Often in modeling the engineering optimization design problems, the value of objective function(s) is not clearly defined in terms of design variables. Instead it is obtained by some numerical analysis such as FE structural analysis, fluid mechanic analysis, and thermodynamic analysis, etc. Yet, the numerical analyses are considerably time consuming to obtain the final value of objective functi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013